Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions
نویسندگان
چکیده
Specialised metabolites from microbial sources are well-known for their wide range of biomedical applications, particularly as antibiotics. When mining paired genomic and metabolomic data sets novel specialised metabolites, establishing links between Biosynthetic Gene Clusters (BGCs) represents a promising way finding such chemistry. However, due to the lack detailed biosynthetic knowledge majority predicted BGCs, large number possible combinations, this is not simple task. This problem becoming ever more pressing with increased availability omics sets. Current tools effective at identifying valid automatically, manual verification considerable bottleneck in natural product research. We demonstrate that using multiple link-scoring functions together makes it easier prioritise true relative others. Based on standardising commonly used score, we introduce new, score an Input-Output Kernel Regression approach. Finally, present NPLinker, software framework link data. Results verified publicly available include validated links.
منابع مشابه
Ranking Based Multitask Learning of Scoring Functions
Scoring functions are an important tool for quantifying properties of interest in many domains; for example, in healthcare, a disease severity scores are used to diagnose the patient’s condition and to decide its further treatment. Scoring functions might be obtained based on the domain knowledge or learned from data by using classification, regression or ranking techniques depending on the typ...
متن کاملRanking and Scoring Using Empirical Risk Minimization
A general model is proposed for studying ranking problems. We investigate learning methods based on empirical minimization of the natural estimates of the ranking risk. The empirical estimates are of the form of a U -statistic. Inequalities from the theory of U -statistics and U processes are used to obtain performance bounds for the empirical risk minimizers. Convex risk minimization methods a...
متن کاملthe use of appropriate madm model for ranking the vendors of mci equipments using fuzzy approach
abstract nowadays, the science of decision making has been paid to more attention due to the complexity of the problems of suppliers selection. as known, one of the efficient tools in economic and human resources development is the extension of communication networks in developing countries. so, the proper selection of suppliers of tc equipments is of concern very much. in this study, a ...
15 صفحه اولRanking from Pairwise Comparisons in the Belief Functions Framework
The problem of deriving a binary relation over alternatives based on paired comparisons is studied. The problem is tackled in the framework of belief functions, which is well-suited to model and manipulate partial and uncertain information. Starting from the work of Tritchler and Lockwood [8], the paper proposes a general model of mass allocation and combination, and shows how to practically de...
متن کاملUsing Wikipedia Categories and Links in Entity Ranking
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our approach utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the examples (when provided) to improve the effectiveness of entity ranking. Our experiments...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS Computational Biology
سال: 2021
ISSN: ['1553-734X', '1553-7358']
DOI: https://doi.org/10.1371/journal.pcbi.1008920